Overview

Dataset Statistics

Number of Variables 11
Number of Rows 2969
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 278.3 KB
Average Row Size in Memory 96.0 B
Variable Types
  • Numerical: 11

Dataset Insights

gross_revenue is skewed Skewed
recency_days is skewed Skewed
invoice_no is skewed Skewed
quantity is skewed Skewed
avg_ticket is skewed Skewed
frequency is skewed Skewed
qtde_returns is skewed Skewed
avg_basket_size is skewed Skewed
avg_unique_basket_size is skewed Skewed
qtde_returns has 1481 (49.88%) zeros Zeros

Variables


customer_id

numerical

Approximate Distinct Count 2969
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 15270.773
Minimum 12347
Maximum 18287
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • customer_id is skewed right (γ1 = 0.0316)

Quantile Statistics

Minimum 12347
5-th Percentile 12619.4
Q1 13799
Median 15221
Q3 16768
95-th Percentile 17964.6
Maximum 18287
Range 5940
IQR 2969

Descriptive Statistics

Mean 15270.773
Standard Deviation 1718.9903
Variance 2.9549e+06
Sum 4.5339e+07
Skewness 0.03159
Kurtosis -1.2061
Coefficient of Variation 0.1126

gross_revenue

numerical

Approximate Distinct Count 2954
Approximate Unique (%) 99.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 2749.3217
Minimum 6.2
Maximum 279138.02
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • gross_revenue is skewed right (γ1 = 16.7691)

Quantile Statistics

Minimum 6.2
5-th Percentile 229.77
Q1 570.96
Median 1086.92
Q3 2308.06
95-th Percentile 7219.68
Maximum 279138.02
Range 279131.82
IQR 1737.1

Descriptive Statistics

Mean 2749.3217
Standard Deviation 10580.6233
Variance 1.1195e+08
Sum 8.1627e+06
Skewness 16.7691
Kurtosis 353.3469
Coefficient of Variation 3.8484
  • gross_revenue is not normally distributed (p-value 4.949126309702442e-25)
  • gross_revenue has 269 outliers

recency_days

numerical

Approximate Distinct Count 272
Approximate Unique (%) 9.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 64.2876
Minimum 0
Maximum 373
Zeros 34
Zeros (%) 1.1%
Negatives 0
Negatives (%) 0.0%
  • recency_days is skewed right (γ1 = 1.7975)

Quantile Statistics

Minimum 0
5-th Percentile 2
Q1 11
Median 31
Q3 81
95-th Percentile 242
Maximum 373
Range 373
IQR 70

Descriptive Statistics

Mean 64.2876
Standard Deviation 77.7568
Variance 6046.1167
Sum 190870
Skewness 1.7975
Kurtosis 2.7713
Coefficient of Variation 1.2095
  • recency_days is not normally distributed (p-value 9.457436240821924e-12)
  • recency_days has 286 outliers

invoice_no

numerical

Approximate Distinct Count 56
Approximate Unique (%) 1.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 5.7231
Minimum 1
Maximum 206
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • invoice_no is skewed right (γ1 = 10.7614)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 2
Median 4
Q3 6
95-th Percentile 17
Maximum 206
Range 205
IQR 4

Descriptive Statistics

Mean 5.7231
Standard Deviation 8.8565
Variance 78.4381
Sum 16992
Skewness 10.7614
Kurtosis 190.5112
Coefficient of Variation 1.5475
  • invoice_no is not normally distributed (p-value 7.36579815170809e-24)
  • invoice_no has 235 outliers

quantity

numerical

Approximate Distinct Count 49
Approximate Unique (%) 1.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 11.6352
Minimum 1
Maximum 102
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • quantity is skewed right (γ1 = 3.0978)

Quantile Statistics

Minimum 1
5-th Percentile 4
Q1 8
Median 11
Q3 14
95-th Percentile 22
Maximum 102
Range 101
IQR 6

Descriptive Statistics

Mean 11.6352
Standard Deviation 6.2738
Variance 39.3612
Sum 34545
Skewness 3.0978
Kurtosis 25.291
Coefficient of Variation 0.5392
  • quantity is not normally distributed (p-value 2.6753494818214517e-09)
  • quantity has 95 outliers

avg_ticket

numerical

Approximate Distinct Count 2966
Approximate Unique (%) 99.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 51.8978
Minimum 2.1506
Maximum 56157.5
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • avg_ticket is skewed right (γ1 = 53.4172)

Quantile Statistics

Minimum 2.1506
5-th Percentile 4.9167
Q1 13.1193
Median 17.9566
Q3 24.9883
95-th Percentile 90.497
Maximum 56157.5
Range 56155.3494
IQR 11.869

Descriptive Statistics

Mean 51.8978
Standard Deviation 1036.9344
Variance 1.0752e+06
Sum 154084.4539
Skewness 53.4172
Kurtosis 2885.8393
Coefficient of Variation 19.9803
  • avg_ticket is not normally distributed (p-value 4.226613732775838e-25)
  • avg_ticket has 346 outliers

avg_recency_days

numerical

Approximate Distinct Count 1258
Approximate Unique (%) 42.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 67.3485
Minimum 1
Maximum 366
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • avg_recency_days is skewed right (γ1 = 2.0617)

Quantile Statistics

Minimum 1
5-th Percentile 8
Q1 25.9231
Median 48.2857
Q3 85.3333
95-th Percentile 201
Maximum 366
Range 365
IQR 59.4103

Descriptive Statistics

Mean 67.3485
Standard Deviation 63.5449
Variance 4037.958
Sum 199957.7303
Skewness 2.0617
Kurtosis 4.8769
Coefficient of Variation 0.9435
  • avg_recency_days is not normally distributed (p-value 0.00033409451870142833)
  • avg_recency_days has 212 outliers

frequency

numerical

Approximate Distinct Count 1225
Approximate Unique (%) 41.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 0.1138
Minimum 0.00545
Maximum 17
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • frequency is skewed right (γ1 = 24.8679)

Quantile Statistics

Minimum 0.00545
5-th Percentile 0.008894
Q1 0.01634
Median 0.02589
Q3 0.04945
95-th Percentile 1
Maximum 17
Range 16.9946
IQR 0.03311

Descriptive Statistics

Mean 0.1138
Standard Deviation 0.4082
Variance 0.1666
Sum 337.8642
Skewness 24.8679
Kurtosis 987.6977
Coefficient of Variation 3.5867
  • frequency is not normally distributed (p-value 5.59374614723187e-25)
  • frequency has 371 outliers

qtde_returns

numerical

Approximate Distinct Count 214
Approximate Unique (%) 7.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 62.157
Minimum 0
Maximum 80995
Zeros 1481
Zeros (%) 49.9%
Negatives 0
Negatives (%) 0.0%
  • qtde_returns is skewed right (γ1 = 51.7716)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 1
Q3 9
95-th Percentile 100.6
Maximum 80995
Range 80995
IQR 9

Descriptive Statistics

Mean 62.157
Standard Deviation 1512.4961
Variance 2.2876e+06
Sum 184544
Skewness 51.7716
Kurtosis 2760.8715
Coefficient of Variation 24.3335
  • qtde_returns is not normally distributed (p-value 4.227221068494339e-25)
  • qtde_returns has 417 outliers

avg_basket_size

numerical

Approximate Distinct Count 1979
Approximate Unique (%) 66.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 249.8138
Minimum 1
Maximum 40498.5
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • avg_basket_size is skewed right (γ1 = 44.6501)

Quantile Statistics

Minimum 1
5-th Percentile 44
Q1 103.25
Median 172.3333
Q3 281.6923
95-th Percentile 600
Maximum 40498.5
Range 40497.5
IQR 178.4423

Descriptive Statistics

Mean 249.8138
Standard Deviation 791.5552
Variance 626559.6179
Sum 741697.0657
Skewness 44.6501
Kurtosis 2251.7395
Coefficient of Variation 3.1686
  • avg_basket_size is not normally distributed (p-value 4.3126973261850275e-25)
  • avg_basket_size has 179 outliers

avg_unique_basket_size

numerical

Approximate Distinct Count 1005
Approximate Unique (%) 33.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 22.1547
Minimum 1
Maximum 299.7059
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • avg_unique_basket_size is skewed right (γ1 = 3.4977)

Quantile Statistics

Minimum 1
5-th Percentile 3.3455
Q1 10
Median 17.2
Q3 27.75
95-th Percentile 56.94
Maximum 299.7059
Range 298.7059
IQR 17.75

Descriptive Statistics

Mean 22.1547
Standard Deviation 19.5123
Variance 380.7307
Sum 65777.3287
Skewness 3.4977
Kurtosis 27.6546
Coefficient of Variation 0.8807
  • avg_unique_basket_size is not normally distributed (p-value 2.832635829597603e-11)
  • avg_unique_basket_size has 172 outliers

Interactions

Correlations

Missing Values